Picture for Neil Zeghidour

Neil Zeghidour

PSL, FAIR, LSCP

CaReAQA: A Cardiac and Respiratory Audio Question Answering Model for Open-Ended Diagnostic Reasoning

Add code
May 02, 2025
Viaarxiv icon

Vision-Speech Models: Teaching Speech Models to Converse about Images

Add code
Mar 19, 2025
Viaarxiv icon

High-Fidelity Simultaneous Speech-To-Speech Translation

Add code
Feb 05, 2025
Viaarxiv icon

MAD Speech: Measures of Acoustic Diversity of Speech

Add code
Apr 16, 2024
Figure 1 for MAD Speech: Measures of Acoustic Diversity of Speech
Figure 2 for MAD Speech: Measures of Acoustic Diversity of Speech
Figure 3 for MAD Speech: Measures of Acoustic Diversity of Speech
Figure 4 for MAD Speech: Measures of Acoustic Diversity of Speech
Viaarxiv icon

MusicRL: Aligning Music Generation to Human Preferences

Add code
Feb 06, 2024
Viaarxiv icon

TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition

Add code
Aug 21, 2023
Figure 1 for TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition
Figure 2 for TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition
Figure 3 for TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition
Viaarxiv icon

AudioPaLM: A Large Language Model That Can Speak and Listen

Add code
Jun 22, 2023
Figure 1 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 2 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 3 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 4 for AudioPaLM: A Large Language Model That Can Speak and Listen
Viaarxiv icon

SoundStorm: Efficient Parallel Audio Generation

Add code
May 16, 2023
Viaarxiv icon

LMCodec: A Low Bitrate Speech Codec With Causal Transformer Models

Add code
Mar 23, 2023
Viaarxiv icon

Speech Intelligibility Classifiers from 550k Disordered Speech Samples

Add code
Mar 15, 2023
Viaarxiv icon